Search CORE

11 research outputs found

Event Causality Identification with Causal News Corpus -- Shared Task 3, CASE 2022

Author: Caselli Tommaso
Hettiarachchi Hansi
Hürriyetoğlu Ali
Liza Farhana Ferdousi
Oostdijk Nelleke
Tan Fiona Anting
Uca Onur
Publication venue
Publication date: 01/01/2022
Field of study

The Event Causality Identification Shared Task of CASE 2022 involved two subtasks working on the Causal News Corpus. Subtask 1 required participants to predict if a sentence contains a causal relation or not. This is a supervised binary classification task. Subtask 2 required participants to identify the Cause, Effect and Signal spans per causal sentence. This could be seen as a supervised sequence labeling task. For both subtasks, participants uploaded their predictions for a held-out test set, and ranking was done based on binary F1 and macro F1 scores for Subtask 1 and 2, respectively. This paper summarizes the work of the 17 teams that submitted their results to our competition and 12 system description papers that were received. The best F1 scores achieved for Subtask 1 and 2 were 86.19% and 54.15%, respectively. All the top-performing approaches involved pre-trained language models fine-tuned to the targeted task. We further discuss these approaches and analyze errors across participants' systems in this paper.Comment: Accepted to the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2022

arXiv.org e-Print Archive

Birmingham City University Open Access Repository

BCU Open Access

University of East Anglia digital repository

The Causal News Corpus: Annotating causal relations in event sentences from news

Author: Ameer Iqra
Caselli Tommaso
Hettiarachchi Hansi
Hu Tiancheng
Hürriyetoğlu Ali
Liza Farhana Ferdousi
Nomoto Tadashi
Oostdijk Nelleke
Tan Fiona Anting
Uca Onur
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2022
Field of study

Despite the importance of understanding causality, corpora addressing causal relations are limited. There is a discrepancy between existing annotation guidelines of event causality and conventional causality corpora that focus more on linguistics. Many guidelines restrict themselves to include only explicit relations or clause-based arguments. Therefore, we propose an annotation schema for event causality that addresses these concerns. We annotated 3,559 event sentences from protest event news with labels on whether it contains causal relations or not. Our corpus is known as the Causal News Corpus (CNC). A neural network built upon a state-of-the-art pre-trained language model performed well with 81.20% F1 score on test set, and 83.46% in 5-folds cross-validation. CNC is transferable across two external corpora: CausalTimeBank (CTB) and Penn Discourse Treebank (PDTB). Leveraging each of these external datasets for training, we achieved up to approximately 64% F1 on the CNC test set without additional fine-tuning. CNC also served as an effective training and pre-training dataset for the two external corpora. Lastly, we demonstrate the difficulty of our task to the layman in a crowd-sourced annotation exercise. Our annotated corpus is publicly available, providing a valuable resource for causal text mining researchers

arXiv.org e-Print Archive

Repository for Publications and Research Data

Proceedings - University of Groningen

University of Groningen

Birmingham City University Open Access Repository

ARTS repository - University of Groningen

BCU Open Access

University of East Anglia digital repository

Dissertations of the University of Groningen

The Causal News Corpus: Annotating Causal Relations in Event Sentences from News

Author: Ameer Iqra
Caselli Tommaso
Hettiarachchi Hansi
Hu Tiancheng
Hürriyeto˘glu Ali
Liza Farhana Ferdousi
Nomoto Tadashi
Oostdijk Nelleke
Tan Fiona Anting
Uca Onur
Publication venue
Publication date: 01/06/2022
Field of study

BCU Open Access

Constructing and Interpreting Causal Knowledge Graphs from News

Author: Koji Miura
Ng See-Kiong
Paul Debdeep
Tan Fiona Anting
Yamaura Sahim
Publication venue
Publication date: 16/05/2023
Field of study

Many jobs rely on news to learn about causal events in the past and present, to make informed decisions and predictions about the future. With the ever-increasing amount of news and text available on the internet, there is a need to automate the extraction of causal events from unstructured texts. In this work, we propose a methodology to construct causal knowledge graphs (KGs) from news using two steps: (1) Extraction of Causal Relations, and (2) Argument Clustering and Representation into KG. We aim to build graphs that emphasize on recall, precision and interpretability. For extraction, although many earlier works already construct causal KGs from text, most adopt rudimentary pattern-based methods. We close this gap by using the latest BERT-based extraction models alongside pattern-based ones. As a result, we achieved a high recall, while still maintaining a high precision. For clustering, we utilized a topic modelling approach to cluster our arguments, so as to increase the connectivity of our graph. As a result, instead of 15,686 disconnected subgraphs, we were able to obtain 1 connected graph that enables users to infer more causal relationships from. Our final KG effectively captures and conveys causal relationships, validated through multiple use cases and user feedback.Comment: Submitted to conferenc

arXiv.org e-Print Archive

Event Causality Identification with Causal News Corpus - Shared Task 3, CASE 2022

Author: Caselli Tommaso
Hettiarachchi Hansi
Hürriyetoğlu Ali
Liza Farhana Ferdousi
Oostdijk Nelleke
Tan Fiona Anting
Uca Onur
Publication venue
Publication date: 01/01/2022
Field of study

The Event Causality Identification Shared Task of CASE 2022 involved two subtasks working on the Causal News Corpus. Subtask 1 required participants to predict if a sentence contains a causal relation or not. This is a supervised binary classification task. Subtask 2 required participants to identify the Cause, Effect and Signal spans per causal sentence. This could be seen as a supervised sequence labeling task. For both subtasks, participants uploaded their predictions for a held-out test set, and ranking was done based on binary F1 and macro F1 scores for Subtask 1 and 2, respectively. This paper summarizes the work of the 17 teams that submitted their results to our competition and 12 system description papers that were received. The best F1 scores achieved for Subtask 1 and 2 were 86.19% and 54.15%, respectively. All the top-performing approaches involved pretrained language models fine-tuned to the targeted task. We further discuss these approaches and analyze errors across participants’ systems in this paper

University of East Anglia digital repository

The Causal News Corpus: Annotating Causal Relations in Event Sentences from News

Author: Ameer Iqra
Anting Tan Fiona
Caselli Tommaso
Ferdousi Liza Farhana
Hettiarachchi Hansi
Hu Tiancheng
Hürriyetoğlu Ali
Nomoto Tadashi
Oostdijk Nelleke
Uca Onur
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/06/2022
Field of study